Search CORE

11 research outputs found

Piecewise Latent Variables for Neural Variational Text Processing

Author: Courville Aaron
Ororbia II Alexander G.
Pineau Joelle
Serban Iulian V.
Publication venue
Publication date: 01/01/2017
Field of study

Advances in neural variational inference have facilitated the learning of powerful directed graphical models with continuous latent variables, such as variational autoencoders. The hope is that such models will learn to represent rich, multi-modal latent factors in real-world data, such as natural language text. However, current models often assume simplistic priors on the latent variables - such as the uni-modal Gaussian distribution - which are incapable of representing complex latent factors efficiently. To overcome this restriction, we propose the simple, but highly flexible, piecewise constant distribution. This distribution has the capacity to represent an exponential number of modes of a latent target distribution, while remaining mathematically tractable. Our results demonstrate that incorporating this new latent distribution into different models yields substantial improvements in natural language processing tasks such as document modeling and natural language generation for dialogue.Comment: 19 pages, 2 figures, 8 tables; EMNLP 201

arXiv.org e-Print Archive

Crossref

Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models

Author: Bengio Yoshua
Courville Aaron
Pineau Joelle
Serban Iulian V.
Sordoni Alessandro
Publication venue
Publication date: 05/03/2016
Field of study

We investigate the task of building open domain, conversational dialogue systems based on large dialogue corpora using generative models. Generative models produce system responses that are autonomously generated word-by-word, opening up the possibility for realistic, flexible interactions. In support of this goal, we extend the recently proposed hierarchical recurrent encoder-decoder neural network to the dialogue domain, and demonstrate that this model is competitive with state-of-the-art neural language models and back-off n-gram models. We investigate the limitations of this and similar approaches, and show how its performance can be improved by bootstrapping the learning from a larger question-answer pair corpus and from pretrained word embeddings.Comment: 8 pages with references; Published in AAAI 2016 (Special Track on Cognitive Systems

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Towards an Automatic Turing Test: Learning to Evaluate Dialogue Responses

Author: Angelard-Gontier Nicolas
Bengio Yoshua
Lowe Ryan
Noseworthy Michael
Pineau Joelle
Serban Iulian V.
Publication venue
Publication date: 01/01/2017
Field of study

Automatically evaluating the quality of dialogue responses for unstructured domains is a challenging problem. Unfortunately, existing automatic evaluation metrics are biased and correlate very poorly with human judgements of response quality. Yet having an accurate automatic evaluation procedure is crucial for dialogue research, as it allows rapid prototyping and testing of new models with fewer expensive human evaluations. In response to this challenge, we formulate automatic dialogue evaluation as a learning problem. We present an evaluation model (ADEM) that learns to predict human-like scores to input responses, using a new dataset of human response scores. We show that the ADEM model's predictions correlate significantly, and at a level much higher than word-overlap metrics such as BLEU, with human judgements at both the utterance and system-level. We also show that ADEM can generalize to evaluating dialogue models unseen during training, an important step for automatic dialogue evaluation.Comment: ACL 201

arXiv.org e-Print Archive

Crossref

Deep Discourse Analysis for Generating Personalized Feedback in Intelligent Tutor Systems

Author: Belfer Robert
Cheung Jackie C. K.
Grenander Matt
Kochmar Ekaterina
Serban Iulian V.
St-Hilaire François
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 13/03/2021
Field of study

We explore creating automated, personalized feedback in an intelligent tutoring system (ITS). Our goal is to pinpoint correct and incorrect concepts in student answers in order to achieve better student learning gains. Although automatic methods for providing personalized feedback exist, they do not explicitly inform students about which concepts in their answers are correct or incorrect. Our approach involves decomposing students answers using neural discourse segmentation and classification techniques. This decomposition yields a relational graph over all discourse units covered by the reference solutions and student answers. We use this inferred relational graph structure and a neural classifier to match student answers with reference solutions and generate personalized feedback. Although the process is completely automated and data-driven, the personalized feedback generated is highly contextual, domain-aware and effectively targets each student's misconceptions and knowledge gaps. We test our method in a dialogue-based ITS and demonstrate that our approach results in high-quality feedback and significantly improved student learning gains

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Graph Convolutional Network with Sequential Attention for Goal-Oriented Dialogue Systems

Crossref

Bi-Directional Recurrent Attentional Topic Model

Author: Ahmed Amr
Bahdanau Dzmitry
Blei David M.
Bottou Léon
Chen Zhiyuan
Chorowski Jan K.
David
Dieng Adji B.
Gan Zhe
Geoffrey
Griffiths Thomas L.
Gruber Amit
Henao Ricardo
Hoffman Matthew D.
Jon
Jordan
Kiros Ryan
Lai Siwei
Larochelle Hugo
Le Quoc
Li Shuangyin
Marlin Benjamin M.
Mikolov Tomas
Mikolov Tomas
Mikolov Tomas
Newman David
Nitish Srivastava
Rong Pan
Serban Iulian V.
Shuangyin Li
Srivastava Nitish
Srivastava Nitish
Sutskever Ilya
Tang Jian
Wei Xing
Xu Mingyang
Yang Min
Yu Zhang
Zhai Ke
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Modeling Speech Acts in Asynchronous Conversations: A Neural-CRF Approach

Crossref